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Preface 


The A-calculus and combinatory logic are two systems of logic which 

can also serve as abstract programming languages. They both aim to 
describe some very general properties of programs that can modify other 
programs, in an abstract setting not cluttered by details. In some ways 
they are rivals, in others they support each other. 

They support a sound ground for computational thinking. 
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A. A-calculus 


The A-calculus was invented around 1930 by an American logician 

Alonzo Church, as part of a comprehensive logical system which included 
higher-order operators (operators which act on other operators). In fact 

the language of A-calculus, or some other essentially equivalent notation, 
is a key part of most higher-order languages, whether for logic or for 
computer programming. Indeed, the first uncomputable problems to be 
discovered were originally described, not in terms of idealized computers 
such as Turing machines, but in A-calculus. 
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B. Combinatory Logic 


Combinatory logic has the same aims as A-calculus, and can express 

the same computational concepts, but its grammar is much simpler. Its 
basic idea is due to two people: Moses Sch onfinkel, who first thought of 
itin 1920, and Haskell Curry, who independently re-discovered it seven 
years later and turned it into a workable technique. 


Both the A-calculus and combinatory logic serve as one of the main toools 
for computational thinkng. 


C. What is computational thinking? 


Computational thinking is the process in 
which you creatively apply the four-stage 
problem-solving cycle to the ideas, 
problems, and opportunities you 
encounter to develop and test solutions. 
The emphasis is on learning to take real 
life situations and abstract away—often 
from programs—so that the computer can 
compute the answer.. 
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D. Where does computational thinking shows 
itself up? 


Computational thinking (CM) applies to 
everyone. This allows managers, decision 
makers and administrators to think 
horizontally in order to generate a wider 
range of solutions and adapt their 
strategy to become more competitive. 
This allows technical teams to optimize 
their methods based on the latest 
calculations, providing an innovative 
result for analysis. 


E. 4-stages cycle of solving problems 


1 IDENTIFY QUESTIONS 


* Consider the scope and details of the problem 
by identifying manageable issues to address. 
Determine the information you have or will need 
to obtain in order to solve the problem. 


3 CALCULATE ANSWERS 


* Turn an abstract question into an abstract 
answer using computational power, usually with 
computers. Identification and elimination of 
operational problems during calculations. 


2 ABSTRACT TO A COMPUTABLE 
FORM 


e Transform the question into an abstract, 
precise form - code, diagrams, or calculation 
algorithms. Select the concepts and tools that 
will be used to arrive at the solution. 


4 INTERPRET THE RESULTS 


* Таке an abstract answer and interpret the 
results by recontextualizing them within your 
original questions and testing them 
skeptically. Take another turn to correct or 
clarify. 
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F. Re-decision with new ideas 


= 
юэ 


Often the answer to one question can be 
used to re-solve, repeating the four-step 
process with new ideas. Thus, 
computational thinking can be thought of 
as a spiral, consisting of a roadway of 
four steps, repeated sequentially until you 
reach a solution suitable for the original 
goal. 
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e цаас 1. The A-calculus 
1A Introduction 

What is usually called A-calculus is a collection of several formal 
systems, based on a notation invented by Alonzo Church in the 
1930s. They are designed to describe the most basic ways that 
operators or functions can be combined to form other operators. 
In practice, each A-system has a slightly different grammatical 
structure, depending on its iniended use. Some have extra 
constant-symbols, and most have built-in syntactic restrictions, for 
example type-restrictions. 
But to begin with, it is best to avoid these complications; hence 
the system presented in this chapter will be the ‘pure’ one, which is 
syntactically the simplest. 
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1A Introduction 


To motivate the A-notation, consider the everyday mathematical 
expression 'x - y'. This can be thought of as defining either a 
function f of x or a function g of y; 
f(x)=x-y, g(y7x-y. 

or 

f:x5 x-y 4:у-хХ-у. 
And there is a need for a notation that gives f and g different names 
in some systematic way. In practice, mathematicians usually avoid 
this need by various 'ad-hoc' special notations, but these can get 
very clumsy when higher-order functions are involved (functions 
which act on other functions). 
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1A Introduction 


Church's notation is a systematic way of constructing, for each 
expression involving ‘x’, a notation for the corresponding function 
of x (and similarly for ‘y’, etc.). Church introduced ‘A’ as an auxiliary 
symbol and wrote 
f=Ax.x-y g^My.x-y. 
For example, consider the equations 
f(0)20-y, f(1)21-y. 
In the A-notation these become 
(Лх.х- у)(0) = 0 - у, (Ax.x-y)y(1)*21-y. 
These equations are clumsier than the originals, but do not be put 
off by this; the A-notation is principally intended for denoting higher- 
order functions, not just functions of numbers, and for this it turns 
out to be no worse than others. This notation is systematic, and 
more suitable for incorporation into a programming language. 
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1A Introduction 


The A-notation can be extended to functions of more than one 
variable. 
For example, the expression 'x - y' determines two functions h 
and k of two variables, defined by 

h(x,y)=x-y, Қ(у, х) =х -у. 
These сап be denoted by 

h=Axy.x-y, k=Ayx.x-y. 
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1A Introduction 


However, we can avoid the need for a special notation for functions 
of several variables by using functions whose values are not 
numbers but other functions. 
For example, instead of the two-place function h above, 
consider the one-place function h* defined by 
h* = Ax.(Ay.x - у). 
For each number a, we have 
h* (a)=Ay.a-y; 
hence for each pair of numbers a, b, 
(h* (3) (b) = (Ay . a ~ y)(b) 
-a-b 
= h(a, b). 
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1A Introduction 


Thus h* can be viewed as ‘representing’ h. For this reason, we shall 
largely ignore functions of more than one variable in this book. 
From now on, 'function' will mean 'function of one variable' unless 
explicitly stated otherwise. (The use of h* instead of h is usually 
called currying.) 

Having looked at A-notation in an informal context, let us now 
construct a formal system of A-calculus. 
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1A Introduction 


Definition 1.1 (A-terms) Assume that there is given an infinite 
sequence of expressions vO , v00, v000,.... called variables, and а 
finite, infinite or empty sequence of expressions called atomic 
constants, different from the variables. (When the sequence of 
atomic constants is empty, the system will be called pure, otherwise 
applied.) The set of expressions called A-terms is defined nductively 
as follows: 


(a) all variables and atomic constants are A-terms (called atoms); 
(b) if M and N are any ^-terms, then (MN) is a A-term (called an 
application); 

(c) if M is any A-term and х is any variable, then (Ax.M) is a A-term 
(called an abstraction). 
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1A Introduction 


Example 1.2 (Some A-terms) 
(а) (AvO . (v0v00)) is a A-term. 

If x, y, z are any distinct variables, the following are A-terms: 

(b) (Ax. (xy)), (d) (x (Ax. (Ax.x))), 

(c) ((Ay.y (Ax. (ху))), (e) (Ax. (yz)). 
In (d), there are two occurrences of Ax in one term; this is allowed 
by Definition 1.1, though not encouraged in practice. Part (e) shows 
a term of form (Ax.M) such that x does not occur in M; this is called 
a vacuous abstraction, and such terms denote constant functions 
(functions whose output is the same for all inputs). 
By the way, the expression ‘A’ by itself is not a term, though it may 
occur in terms; similarly the expression 'Ax' is not a term. 


1A Introduction 


Notation 1.3 Capital letters will denote arbitrary A-terms in this 
chapter. 
Letters ‘x’, ‘y’, ‘2°, ‘u’, ‘v’, ‘w will denote variables throughout the 
book, and distinct letters will denote distinct variables unless stated 
otherwise. 
Parentheses will be omitted in such a way that, for example, “ММРО” 
Will denote the term (((MN)P)Q). (This convention is called 
association to the left.) 
Other abbreviations will be 
Ax.PQ for (Ax. (PQ)), 
Ax1x2 ... xn.M for (Ax1 . (Ax2 . (. . . (Axn.M).. .))). 
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1A Introduction 


Syntactic identity of terms will be denoted by ‘=’; in other words 
M=N 
will mean that M is exactly the same term as N. (The symbol ‘=’ will 
be used in formal theories of equality, and for identity of objects that 
are not terms, such as numbers.) 
It will be assumed of course that 
if MN = РО then M = P and N = О, 
and if Ax.M = Ay.P then x = y and М = Р. 
It will also be assumed that variables are distinct from constants, 
and applications are distinct from abstractions, etc. Such 
assumptions are always made when languages are defined, and will 
be left unstated in future. 
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1A Introduction 


The cases k = 0, п = 0 in statements like 
‘P=MN1...Nk (к> 0)’ 


or 
‘T has form Ax1... xn .PQ (n 2 0)’ 
will mean 
‘PZM 
or 


‘T has form PQ’. 
‘X will often be used carelessly to mean ‘A-calculus in general’. 
‘Iff * will be used for ‘if and only if’. 
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1A Introduction 


Exercise 1.4 » 
Insert the full number of parentheses and A's into the 
following abbreviated A-terms: 
(a) xyz(yx), (d) (Au.vuu)zy, 
(b) Ax.uxy, (e) ux(yz)(Av.vy), 
(c) Au.u(Ax.y), (f) (Axyz.xz(yz))uvw. 
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1A Introduction 


Informal interpretation 1.5 
Not all systems based on A-calculus use all the terms allowed by 


Definition 1.1, and in most systems, some terms are left 
uninterpreted, as we shall see later. But the interpretations of 
those A-terms which are interpreted may be given as follows, 
roughly speaking. 


In general, if M has been interpreted as a function or operator, then 
(MN) is interpreted as the result of applying M to argument N, 
provided this result exists. 


A term (Ax.M) represents the operator or function whose value at an 
argument N is calculated by substituting N for x in M. 
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1A Introduction 


For example, Ax.x(xy) represents the operation of applying a 

function twice to an object denoted by y; and the equation 
(Ax.x(xy))N = N(Ny) 

holds for all terms N, in the sense that both sides have the same 

interpretation. 


For a second example, Ax.y represents the constant function that 
takes the value y for all arguments, and the equation 

(Ax.y)N = y 
holds in the same sense as before. 


45 НИЯУ 
© халаг 1. The A-calculus 
1B Term-structure and substitution 


The main topic of the chapter will be a formal procedure for 
calculating with terms, that will closely follow their informal 
meaning. But before defining it, we shall need to know how to 
substitute terms for variables, and this is not entirely 
straightforward. 
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1B Term-structure and substitution 


The main topic of the chapter will be a formal procedure for 
calculating with terms, that will closely follow their informal 
meaning. But before defining it, we shall need to know how to 
substitute terms for variables, and this is not entirely 
straightforward. 

Definition 1.6 The length of a term M (called Igh(M)) is the total 
number of occurrences of atoms in M. In more detail, define 
(a) Igh(a) = 1 for atoms a; 

(b) Igh(MN) = Igh(M) + Igh(N); 

(c) Igh(Ax.M) = 1+Ilgh(M). 

The phrase ‘induction on M’ will mean ‘induction on Igh(M)’. 
For example, if M = x(Ay.yux) then Igh(M) = 5. 
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1B Term-structure and substitution 


Definition 1.7 For A-terms P and Q, the relation P occurs in Q (or P 
is a subterm of Q, or Q contains P) is defined by induction on Q, 
thus: 

(a) P occurs in P; 

(b) if P occurs in M or in N, then P occurs in (MN); 

(c) if P occurs in M or P € x, then P occurs in (Ax.M). 

The meaning of 'an occurrence of P in Q' is assumed to be 
intuitively clear. For example, in the term ((xy)(Ax. (xy))) there are 
two occurrences of (xy) and three occurrences of x. 
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1B Term-structure and substitution 


Definition 1.9 (Scope) For a particular occurrence of Ax.M in a term 
P, the occurrence of M is called the scope of the occurrence of Ax on 
the left. 

Example 1.10 Let 

Р = (Ay.yx(Ax.y(Ay.z)x))vw. 

The scope of the leftmost Ay in P is yx(Ax.y(Ay.z)x), the scope of Ax 
is y(Ay.z)x, and that of the rightmost Ay is z. 


1B Term-structure and substitution 


Definition 1.11 (Free and bound variables) An occurrence of a 
variable x in a term P is called 

* bound if it is in the scope of a Ax in P, 

* bound and binding, iff it is the x in Ax, 

* free otherwise. 

If x has at least one binding occurrence in P, we call x a bound variable 
of P. If x has at least one free occurrence in P, we call x a free variable 
of P. The set of all free variables of P is called 

FV(P). 

A closed term is a term without any free variables. 


D бу 


1B Term-structure and substitution 


Examples Consider the term xv(Ayz.yv)w: this is really 

(xv) (Ay. (Az. (yv)))w, 
and in it the x on the left is free, the leftmost v is free, the leftmost y 
is both bound and binding, the only z is the same, the rightmost y is 
bound but not binding, the rightmost v is free, and the only w is free. 
In the term P in Example 1.10, all four y's are bound, the leftmost 
and rightmost y's are also binding, the left-hand x is free, the central x 
is bound and binding, the right-hand x is bound but not binding, and 
Z, V, W are free; hence 

FV(P) = {х, 2, v, w}. 
Note that x is both a free and a bound variable of P; this is not normally 
advisable in practice, but is allowed in order to keep the definition of 
‘A-term’ simple. 
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1B Term-structure and substitution 


Definition 1.12 (Substitution) For any M, N, x, define [N/x]M to be the 
result of substituting N for every free occurrence of x in M, and 
changing bound variables to avoid clashes. The precise definition is by 
induction on M, as follows. 

(a) [N/x] х xz N; 

(b) [N/x] a = a for all atoms a = x; 

(c) [N/x](PQ) = [N/x]P [N/x]Q; 

d) [N/x](Ax.P) = А.Р; 

е) [N/x](Ay.P) = Ay.P if x e FV(P); 

f) [N/x](Ay.P) = Ay. [N/x]P if x є FV(P) and y € FV(N); 

0) [N/x](Ay.P) = Az. [N/x][Z/y]P if x e FV(P) and y € FV(N). 

In 1.12(e)-(g), y = x; and in (9), z is chosen to be the first variable 

€ FV(NP).) 


( 
( 
( 
( 
( 
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1B Term-structure and substitution 


Remark 1.13 The purpose of clause 1.12(g) is to prevent the intuitive 
meaning of [N/x](Ay.P) from depending on the bound variable y. For 
example, take three distinct variables w, x, y and look at 

[w/x](Ay.X). 
The term Ay.x represents the constant function whose value is always 
x, so we should intuitively expect [w/x](Ay.x) to represent the constant 
function whose value is always w. And this is what we get; by 1.12(f) 
and (a) we have 
[w/x](Ay.x) = Ay.w. 
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1B Term-structure and substitution 


Exercise 1.14 « Evaluate the following substitutions: 
) [(uv)/x] (Ay.x(Aw.vwx)), 
[(Ay.xy)/x] (Ay.x(AX.x)), 
(Ay.vy)/x] (y (Av.xv)), 


(a 
(b) 
(c) [ 

(а) [(uv)/x] (Ax.zy). 
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1B Term-structure and substitution 


Definition 1.17 (Change of bound variables, congruence) Let 

a term P contain an occurrence of Ax.M, and let y є FV(M). The act 

of replacing this Ах.М by 

Лу. |У/х|М 

is called a change of bound variable or an a-conversion in P. Iff P can 
be changed to Q by a finite (perhaps empty) series of changes of bound 
variables, we shall say P is congruent to Q, or P a-converts to Q, or 

P =a О. 
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1B Term-structure and substitution 


Example 1.18 

Axy.x(xy) = Ax. (Ay.x(xy)) 
=a Ax. (Av.x(xv)) 
=a Au. (Av.u(uv)) 
= Auv.u(UV). 
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1B Term-structure and substitution 


Remark 1.23 (Simultaneous substitution) It is possible to modify 
the definition of [N/x]M in 1.12 to define simultaneous substitution 
[N1/x1,..., Nn/xn JM 

for n 2 2. We shall not need the details here, as only very simple special 
cases of simultaneous substitution will be used in this book. A full 
definition is omitted; the key is to first change any bound 
variables in M that might cause problems. 
By the way, [N1/x1, ... , Nn/xn JM should not be confused with the 
result of n successive substitutions 

[N1/x1 ](... (INn/xn ]M) . . .). 
For example, taken = 2, N1 = и, М = x1 , M = х1х2 ; then 
[u/x1 ]([x1/x2 |М) = [u/x1 ](x1x1 ) = uu, 
[u/x1, x1/x2 JM = ux1. 
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1C B-reduction 


The topic of this section is the calculation procedure that lies at the 
heart of A-calculus and gives it its power. 

A term of form (Ax.M)N represents an operator Ax.M applied to an 
argument N. In the informal interpretation of Ax.M, its value when 
applied to N is calculated by substituting N for x in M, so (Ax.M)N 
can be ‘simplified’ to [N/x]M. This simplification-process is captured 
in the following definition. 
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1C B-reduction 


Definition 1.24 (B-contracting, B-reducing) Any term of form 

(Ax.M)N 
is called a B-redex and the corresponding term 

[N/x]M 
is called its contractum. Iff a term P contains an occurrence of 
(Ax.M)N and we replace that occurrence by [N/x]M, and the result is 
P, we say we have contracted the redex-occurrence in P, and P B- 
contracts to P or 

P - P. 
Iff P can be changed to a term Q by a finite (perhaps empty) series of 
В-сопігасіїопѕ and changes of bound variables, we say P p-reduces 
to Q, or 

PRQ. 


1C B-reduction 


Definition 1.26 A term Q which contains no B-redexes is called a 
8-normal form (or a term in B-normal form or just a B-nf ). The class 
of all B-normal forms is called В-пї or Ap-nf. If a term P B-reduces to a 
term Q in B-nf, then Q is called a B-normal form of P. 

The ‘P’ may be omitted when this causes no confusion. 
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1C B-reduction 


Example 1.27 
(a) In 1.25(c), zv is a B-normal form of (Ax. (Ay.yx)z)v. 
(b) Let L = (Ax.xxy)(Ax.xxy). By 1.25(е) we have 
L =B Ly 18 Lyy =B... 
This sequence is infinite and there is no other way that L can be 
В-гедисеа, so L has no B-normal form. 
(c) Let P = (Au.v)L for the above L. Then P can be reduced in two 
different ways (at least), thus: 
(i) P = (Au.v)L 2B [L/u]v = v; 
(ii) P =B (Au.v)(Ly) by contracting L 
=B (Au.v)(Lyy) by contracting L again... etc. 
o0 P has a normal form v, but also has an infinite reduction. 
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1C B-reduction 


Example 1.27 (cntd) 

(d) The term (Ax.xx)(Ax.xx) in 1.25(d) is usually called O. It is 
not a normal form; indeed it does not reduce to one (because it 
reduces always to itself). But O is 'minimal' in the sense that it 
cannot be reduced to any different term. (In [Ler76] it is proved 
that no other redex is minimal in this sense.) 


e буун 


1C В-гедис їоп 


Remark 1.29 Some terms can be reduced in more than one way. 
One such term, from Example 1.25(c), is (Ax. (Ay.yx)z)v. It has two 
reductions: 
(Ax. (Ay.yx)z)v =B (Ay.yv)z by contracting (Ax. (Ay.yx)z) v 

- zv by contracting (Ay.yv)z; 
(Ах. (Ay.yx)z)v =B (Ax.zx)v by contracting (Ay.yx)z 

-p zv. 
In this case both reductions reach the same normal form. Is this 
always true? Certainly, for any system claiming to represent computation the 
end-result should be independent of the path. So if this property failed for B- 
reduction, any claim by A-calculus to be like a programming language would fail 
from the start. The Church-Rosser theorem below will show that the normal form 
of a term is indeed unique, provided we ignore changes of bound variables. It will 
have many other applications too; in fact it is probably the most often quoted 
theorem on A-calculus. 
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1C B-reduction 


Theorem 1.32 (Church-Rosser theorem for B) If P =В M and 
P =6 N, then there exists a term T such that 
M =В T and N -P Т. 


Note The property described in the Church—Rosser theorem, that if a 
term can be reduced to two different terms then these two terms can 
be further reduced to one term, is called confluence. The theorem 
states that B-reduction is confluent. 

As mentioned before, the most important application of this theorem 
is to show that a computation in À-ca/culus cannot produce two 
essentially different results. 
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Heduction is non-symmetric, but it generates the following symmetric 
relation. 
Definition 1.37 We say P is B-equal or B-convertible to Q (notation 
P =B Q) iff Q can be obtained from P by a finite (perhaps empty) 
series of B-contractions and reversed В-сопігасіїопѕ and changes of 
bound variables. That is, P 2p Q iff there exist PO,..., Pn (n2 0) 
such that 
(vi <n - 1) (Pi =B Pi*1 or Pi* 1 2B Pi or Pi za Рі+1), 

PO = P, Pn = О. 
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Remark 1.43 (Al-terms) Terms without normal forms are like 
expressions which can be computed for ever without reaching a 
result. It is natural to think of such terms as meaningless, or at least 
as carrying less meaning than terms with normal forms. But it is 
possible for a term T to have a normal form, while one of its subterms 
has none. 

(An example is T = (Au.v)O, where Q = (Ax.xx)(Ax.xx) from 1.25(d).) 
Thus a meaningful term can have a part which might be thought 
meaningless. 
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10 B-equality 


Exercise 1.44 (Extra practice) 
(a) Insert all missing parentheses and A's into the following abbreviated A-terms: 
(i) xx(xxx)x, (ii) VW(Axy. их), 

(iii) (Axy.x)uv, (Iv) w(Axyz.xz(yz))uv. 

(b) Mark all the occurrences of xy in the following terms: 

(i) (Axy.xy)xy, (ii) (Axy.xy)(xy), (iii) Лху.ху(ху). 

(c) Do any of the terms in (a) or (b) contain any of the following 
terms as subterms? If so, which contains which? 

(1) Ay-xy, (ii) y(xy), (iii) Axy.x. 

(d) Evaluate the following substitutions: 

(1) [vw/x] (x(Ay.yx)), (ii) [VW/x] (x(Ax.yx)), 

(11) [ux/x] (x(Ay-yx)), (Iv) [uy/x] (x(Ay.yx)). 

(e) Reduce the following terms to B-normal forms: 

(i) (Axy.xyy)uy, (ii) (Axy. yx)(uv)zw, 

(iii) (Axy.x)(Au.u), (iv) (Axyz.xz(yz)) (Лими). 
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Further reading 

Before we move on, here are a few suggestions for supplementary 
reading. 

All of them are general works on A-calculus; books on special 
subtopics will be mentioned in later chapters. 

First, the Internet is a useful source: typing ‘lambda calculus’ into a 
search engine will show plenty of online introductions and 
Summaries, as well as links to more specialized subtopics. 

Also, most introductions to functional programming contain at least 
a quick introduction to A-calculus. 
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1D B-equality 


Further reading 


[HinSel08] R. Hindley, J. Seldin 
Lambda-Calculus and Combinators, an Introduction. 
Cambridge University Press 2008, in English. 


[Wol03] V. E. Wolfengagen. 

Combinatory Logic in Programming. Computations with objects 
through examples and exercises. 

JurlnfoR Ltd., Moscow, Russia, 2003. 2nd edn., in English. 


[Wol04] V. E. Wolfengagen. 
Methods and Means for Computation with Objects.JurlnfoR Ltd., 
Moscow, Russia, 2004. In Russian. 


2 Combinatory logic 
2А Introduction to CL 


oystems of combinators are designed to do the same work as 
systems of A-calculus, but without using bound variables. In fact, the 
annoying technical complications involved in substitution and a- 
conversion will be avoided completely in the present chapter. 
However, for this technical advantage we shall have to sacrifice the 
intuitive clarity of the A-notation. 
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2А Introduction to CL 


To motivate combinators, consider the commutative law of addition 
in arithmetic, which says 
(VX, у) х+у=у + х. 

The above expression contains bound variables х and ‘y’. But these 
can be removed, as follows. We first define an addition operator A by 
A(x, y) = X + y (for all x, y), 

and then introduce an operator C defined by 
(C(f))(x, y) = f(y, x) (for all f, x, y). 
Then the commutative law becomes simply 
А - С(А). 
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2А Introduction to CL 


The operator C may be called a combinator; other examples of such 
operators are the following: 

B, which composes two functions: (B(f, g))(x) = f(g(x)); 
B',areversed composition operator: (B' (f, g))(x) = g(f(x)); 

I, the identity operator: I(f) = f; 

K, which forms constant functions: (K(a))(x) = a; 

S, a stronger composition operator: (S(f, g))(x) = f(x, g(x); 

W, for doubling or ‘diagonalizing’: (W(f))(x) = f(x, x). 
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2А Introduction to CL 


Definition 2.1 (Combinatory logic terms, or CL-terms) 

Assume that there is given an infinite sequence of expressions м0 , 
v00, v000, .. . called variables, and a finite or infinite sequence of 
expressions called atomic constants, including three called basic 
combinators: I, K, S. (If I, K and S are the only atomic constants, 
the system will be called pure, otherwise applied.) The set of 
expressions called CL-terms is defined inductively as follows: 

(a) all variables and atomic constants, including 1, К, S, are CLterms; 
(b) if X and Y are CL-terms, then so is (XY ). 
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2А Introduction to CL 


An atom 16 a variable or atomic constant. A non-redex constant is an 
atomic constant other than I, K, S. A non-redex atom is a variable or 
a non-redex constant. A closed term is a term containing no 
variables. A combinator is a term whose only atoms are basic 
combinators. (In the pure system this is the same as a closed term.) 
Examples of CL-terms (the one on the left is a combinator): 
((S(KS))K), ((S(KvO ))((SK)K)). 
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2А Introduction to CL 


Notation 2.2 Capital Roman letters will denote CL-terms in this 
chapter, and ‘term’ will mean 'CL-term'. 

‘CL will mean ‘combinatory logic’, i.e. the study of systems of 
CLterms. 

(In later chapters, particular systems will be called ‘CLw’, ‘CLE, 

etc., but never just ‘CL’.) 

The rest of the notation will be the same as in Chapter 1. In particular 
x’, Y, Z, U, №, W will stand for variables (distinct unless otherwise 
stated), and ‘=’ for syntactic identity of terms. Also parentheses will 
be omitted following the convention of association to the left, so that 
(((UV )W)X) will be abbreviated to UVWX. 
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2A Introduction to CL 


Definition 2.3 The length of X (or Igh(X)) is the number of 
occurrences of atoms in X: 

(a) Igh(a) = 1 for atoms a; 

(b) Igh(UV) = Igh(U) + Igh(V ). 

For example, if X = xK(SSxy), then Igh(X) = 6. 
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2А Introduction to CL 


Definition 2.4 The relation X occurs in Y , or X is a subterm of Y , 
is defined thus: 

(a) X occurs in X; 

(b) if X occurs in U or in V , then X occurs in (UV ). 

The set of all variables occurring in Y is called FV(Y ). (In CL-terms 
all occurrences of variables are їгее, because there is no À to bind 
them.) 

Example 2.5 Let Y = K(xS)((xSyz)(Ix)). Then xS and x occur in Y 
(and xS has two occurrences and x has three). Also 

FV(Y ) = {x, y, Z}. 
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2А Introduction to CL 


Definition 2.6 (Substitution) [U/x]Y is defined to be the result of 
substituting U for every occurrence of x in Y : that is, 

(a) [U/x]x = U, 

(b) [U/x]a = a for atoms a = x, 

(c) [U/x](VW) = ([U/X]V [U/X]W). 


For all U1, ..., Un and mutually distinct x1, ..., xn , the result of 
simultaneously substituting U1 for x1 , U2 for x2,..., Un for xn in Y 
is called 

[U1/x1, .. . , Un/xn JY. 

Example 2.7 


(a) [(SK)/x](yxx) = y(SK)(SK), 
(b) [(SK)/x, (KI)/y](yxx) = KI(SK)(SK). 
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2B Weak reduction 


In the next section, we shall see how I, K and S can be made to play 
a role that is essentially equivalent to ‘A’. We shall need the following 
reducibility relation. 

Definition 2.9 (Weak reduction) Any term IX, KXY or SXYZ 

is called a (weak) redex. Contracting an occurrence of a weak redex 
in a term U means replacing one occurrence of 

IX by X, or 

KXY by X, or 

SXY Z by XZ(Y Z). 


2B Weak reduction 


Iff this changes U to U', we say that U (weakly) contracts to U', or 
U =w U’ 


Iff V is obtained from U by a finite (perhaps empty) series of weak 
contractions, we say that U (weakly) reduces to V , or 
U =w V. 
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2B Weak reduction 


Example 2.11 Define B 2 S(KS)K. Then BXY Z w X(Y Z) for all 
terms X, Y and Z, since 

BXY Z = S(KS)KXYZ 

-w KSX(KX)Y Z by contracting S(KS)KX to KSX(KX) 

=w S(KX)Y Z by contracting KSX to S 

=w KXZ(Y Z) by contracting S(KX)Y Z 

=w X(Y Z) by contracting KXZ. 
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2B Weak reduction 


Example 2.12 Define C = S(BBS)(KK). Then CXY Z w XZY , since 

CXY Z 8 S(BBS)(KK)XY Z 

-w BBSX(KKX)Y Z by contracting S(BBS)(KK)X 

-w BBSXKY Z by contracting KKX 

-w B(SX)KY Z by 2.11 

-w SX(KY )Z by 2.11 

-w XZ(KY Z) by contracting SX(KY )Z 

=w XZY by contracting KY Z. 

Incidentally, in line 4 of this reduction, a redex KY Z seems to occur; 
but this is not really so, since, when all its parentheses are inserted, 
B(SX)KY Z is really ((((B(SX))K)Y )Z). 


Om 2 


2B Weak reduction 


Definition 2.18 (Abstraction) For every CL-term M and every 
variable x, а CL-term called [x].M is defined by induction on M, thus: 
(a) [x]. M = KM if x not € FV(M); 

(b) [x].x = l; 

(c) [x]. Ux = U if x not e FV(U); 

(f) [x]. UV = S([x]. U)([x].V ) if neither (a) nor (c) applies. 


Example 2.19 
[x].xy = S(Ix].XY([x].y) by 2.18(1) 
= SI(Ky) by 2.18 (b) and (a). 
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2B Weak reduction 


2.2. The combinators I, B, W, C, Phi, Psi. 

There are two given combinators S and K such that 
(S) Sxyz = xz(yz) 

(К) Kxy=x 


The combinator 116 the identity combinator with the property 
(1) Ix =x. 
ouch a combinator may be obtained in the following way, 
Ix = x = Кх(Кх) by (К) 

=SKKx _ by (5) 
so if we define 

|-5КК, 

| will satisfy rule (1). 
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2B Weak reduction 


There are two given combinators S and K such that 
(S) Sxyz = xz(yz) 
(К) Kxy=x 


To define B, we note that 
Bfgx=f(gx) 
= Kix(gx) by (K) 
= S(Kf)gx бу (S) 
= KSf(Kf)gx by (К) 
= S(KS) Kfgx by (S) 
o0 B can be defined by 

B -5(К5)К. 
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2B Weak reduction 


There are two given combinators S and K such that 

(S) Sxyz = xz(yz) 

(К) Kxy=x 

Given a function f of two arguments, the combinator W gives the 
function Wf obtained from f by identifying its arguments, i.e. 

(W) Мх = їхх. 

To define W we proceed as follows, 

Hence, 
Wfx-fxx 

= fx(Ix) (by 
- Sf Ix (by 
= КІР х (by 
= SS(Kl)fx (by 


I 
S) 
K) 
S) 

W=SS(KI). 
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2B Weak reduction 


The combinator C permutes the arguments of a binary function f, 
(C) Cfxy = фух 
Omitting reference to the rules used, we obtain 
Сїху =fyx 
= Sf(Kx)y 
=B(Sf)Kxy 
=BBSfKxy 
= BBSf(KKf)xy 
= S(BBSY(KK)fxy 
so C is defined by 
C = S(BBS)(KK). 
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2B Weak reduction 


The combinator Phi can be considered as a generalization of S or B. 
It performs two-place composition i.e. 
(Phi) Phi fghx = f(gx) (hx). 
It is left to the reader to verify that an acceptable definition of Phi is 
Phi = B(BS)B. 
The combinator Psi has a similar property, 
(Psi) Psi igxy = f(gx)(gy) 
and is defined by 
Psi = B(BW(BC))(BB(BB)). 


